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We discuss the feasibility of seeing a Higgs boson which decays to four bottom quarks through 
a pair of (pseudo-)scalars at the LHCb experiment to argue that the use of &-physics triggers and 
off-line vertex reconstruction, as opposed to jet triggers with b tagging, may be more effective for 
this signal. Focusing on inclusive production for the Higgs, we find that for light scalar masses below 
20 GeV, signal reconstruction efficiencies of order a few percent may be enough for LHCb to find 
evidence for a Higgs with a dominant 46 decay channel. 



I. INTRODUCTION 

In the last decade, tests of the electroweak model at ac- 
celerators give indirect evidence that the standard model 
is the correct description of physics below 1 TeV and 
that it contains a Higgs boson with a mass not too far 
above current limits The Higgs boson itself has not 
been seen Q ■ Assuming the standard model is correct for 
scattering energies beyond 1 TeV, we know to reasonable 
precision the production cross section, decay width and 
branching ratios for any given Higgs mass. Thus search 
strategies can be optimized for such a Higgs. However, 
Higgs phenomenology can be very sensitive to physics be- 
yond the standard model. The Higgs width, at a mass of 
115 GeV, is about a thousandth of the Z boson width. 
Therefore, if there were neutral particles lighter than 
half the Higgs mass with any reasonable coupling to the 
Higgs, then the Higgs could decay dominantly into those 
particles, and the Z width would not necessarily be sig- 
nificantly affected. 

Here we assume that a decay into a pair of new light 
scalars dominates the Higgs width. The scalar, a, sub- 
sequently decays to a pair of bottom quarks, producing 
the signal h — > 46. Such phenomenology has been dis- 
cussed in the context of the NMSSM [3, [j| and could 
also be possible in little Higgs or composite Higgs theo- 
ries or any other such model where new light scalars (or 
pseudo-scalars) are coupled to the Higgs. 

This signal has in fact been searched for at LEP II, 
with a bound on the mass of the Higgs of 110 GeV as- 
suming a 100% branching ratio to the light scalars and as- 
sume the scalars always decay to a pair of 6 quarks, with 
weaker bounds for smaller branching ratios [6j]. Searches 
at hadron colliders are considered very difficult due to 
the large irreducible background from QCD production 
of 4 6 jets. It may be especially difficult at the ATLAS 
and CMS experiments [7|, l8( where triggers require high 
transverse momentum jets and/or leptons and thus the 
signal efficiency for what makes it to disk may be quite 
low. The strategy usually suggested is thus to look at 
Higgs production in association with a W boson decaying 
leptonically, even though the cross section is significantly 
lower than resonant production Q. 

In this note, we study the possibility of seeing the Higgs 
decay to 4 bottom quarks at the LHCb experiment [H|. 



Its lower running luminosity allows for less stringent trig- 
ger requirements and its smaller solid angle coverage al- 
lows for a higher rate of events written to disk than other 
LHC experiments. We find that for a masses below 20 
GeV there are simple kinematic cuts after which the sig- 
nal approaches the size of the 46 QCD background. For 
these lighter a masses, one is required to find two b de- 
cay vertices within a single cone, or two almost merged 
jets. The hope is that using non-standard vertexing tech- 
niques, such as topological vertexing [33| one may be able 
to seed calorimeter clusters with secondary vertices them- 
selves with a reasonable efficiency. In the next section, we 
estimate the total efficiency required, at different points 
in parameter space, to see evidence at 3 a for such a sig- 
nal after one year of running (assuming 2 fb _1 of data 
collected) at LHCb. 

In principle, with the right triggers, ATLAS and CMS 
may do as well or better than LHCb for this signal. We 
study LHCb because their planned inclusive muon trigger 
acts as an "existence proof that this data will be written 
to disk. Also, in inclusive production, the Higgs is typ- 
ically produced with a significant boost along the beam 
at the LHC, and thus at least one 6 quark often lives 
at large pseudo-rapidity, and therefore excellent tracking 
and vertexing in the forward region may be crucial for 
this signal. Finally, the possibility of doing useful energy 
frontier physics studies at the LHCb experiment could 
complement the ATLAS and CMS programs and should 
be explored. 

To mock up the large number of theories which contain 
this physics, the 'model' we are studying is simply the 
standard model with an additional real uncharged scalar 
a. The new scalar can couple to the Higgs doublet via, 
for example, 

-Xa 2 H^H. (1) 
2 v ' 

After electroweak symmetry breaking, this term will gen- 
erate a tree-level mass squared for a of m 2 a — (1/2) Aw 2 
and the trilinear coupling (l/2)Xvha 2 , where v ~ 246 
GeV is the Higgs' vacuum expectation value (vev). The 
cubic coupling allows (for a light enough a) the Higgs to 
decay into a pair of scalars, with a rate 

r(h->aa)~— Jl £ 2 
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When compared to the total decay rate of a (e.g., 115 
GeV) standard model Higgs, we see that for the above 
partial width to dominate the Higgs decay, the coupling 
to the scalar should be A > (.025). If the physical mass of 
a isn't a fine-tuned combination of different contributions 
(i.e., a cancellation to no less than 10%), then we should 
expect its mass to be above roughly 10 GeV if it is to 
dominate the Higgs width. 

The most natural way for the singlet to decay is in the 
same way as a standard model Higgs boson of the same 
mass. This would occur if the singlet mixes with the 
Higgs via a singlet vev or an operator a\H\ 2 . The singlet 
could also be Higgs-like if it mixes with other Higgs-like 
states. In the next-to-minimal supersymmetric standard 
model (NMSSM), a pseudo-scalar can be naturally light 
and mix with the heavier CP-odd part of the uneaten 
Higgs doublet. In any case, even a tiny mixing of this 
type would allow a, say, 20 GeV scalar a prompt decay 
rate, dominantly into 66. We study precisely this sig- 
nal setting the cross section of Higgs production to that 
of the standard model (at leading order) and the decay 
branching ratios of h — > aa — > bbbb to unity. The sig- 
nificances we quote scale with these quantities, for fixed 
backgrounds. Other decay options can be generated with 
additional new physics at the electroweak scale. See for 
example 



II. SIGNALS AND BACKGROUNDS AT LHCB 

The signal we studied was Higgs production with h — > 
aa and a — > bb. Signal events were produced and show- 



ered using Pythia v6.409 [ll|. Our analysis was per- 
formed for a number of different masses for h and a. The 
study was done for mh at 115, 130 and 145 GeV with m a 
between 15 and 35 GeV in 5 GeV steps. Pythia gener- 
ates the signal at leading order plus showering. At this 
order, the integrated cross-section for Higgs production 
at LHC energies is ~ 25pb for mu = 115 GeV, ~ 20pb 
for m h = 130 GeV, and - 15pb for m h = 145 GeV. 
Next-to-leading-order and higher corrections add ~ 60% 
to this [12J, though we do not include the enhancement 
as we also run the background at leading order. We as- 
sume branching ratios of unity for both the h — > aa and 
a — > bb decays. 

The background we include in this study is QCD pro- 
duction of four b quarks. QCD 26 production does not 
contribute significantly after double-counting of events is 
taken into account. Background events were produced in 
ALPGEN v2.10 [H and then showered through Pythia 
v6.325. Backgrounds not considered were those in which 
only three b quarks or fewer are produced in the accep- 
tance region (important when a c or light quark or gluon 
is mistagged as a & quark) 1 . We do not study these chan- 



nels as their importance depends sensitively on mistag 
rates, and using estimates of jet mistagging at the Teva- 
tron suggests that these contributions will not dominate 
the background 

The bb QCD background is potentially more significant 
in the kinematic region where the pairs of b jets from the 
individual scalar decays merge (i.e., when the scalars are 
light). QCD production of two b jets will be enormous 
at the LHC, with a cross section on the order of 1 mb - 
roughly a factor of 1000 larger than 46 production. Thus 
the technique for tagging two merged 6 jets better include 
a very small 1 6 to 2 6 fake rate (less than a few percent). 

It is expected that signal and background events will 
pass the level zero (L0) trigger at LHCb with high effi- 
ciency, while the recent upgrade of the LHCb read out 
rate to 2 kHz has permitted the addition of an inclusive 
high-level muon trigger. The inclusive branching ratio of 
6 hadrons to muons is ~ 11%, so given 4 6's per event, 
there is a ~ 36% probability that a given event will see 
the decay of at least one 6 hadron to a muon. The 6's 
from signal events are expected to be fairly hard for the 
forward acceptance range of LHCb pr (> 10 GeV) given 
the large invariant mass of the Higgs boson, so it seems 
reasonable to assume that > 50% of these muon decays 
will be triggered on. This trigger path by itself therefore 
gives an expected trigger efficiency of order or better than 
20% [HJ . Other high-level triggers which are already in 
place or could be designed for this purpose may increase 
this efficiency significantly and are worthy of study. 

To avoid generic QCD backgrounds, we require the 
tagging of all four 6 quarks. However, rather than first 
reconstructing jets and then applying a b-tag (the usual 
method) , we assume the four 6 vertices are found first us- 
ing a method such as 'topological vertexing' [33| . Then, 
the 6-hadrons are used as seeds of jets with cone-size 
R = y 5(f) 2 + Sr] 2 =0.6 (where Sr] and 8(f> are the spread 
in pseudo rapidity and azimuthal angle). When the 
jets overlap, we divide up the momenta of particles in 
the overlapping region between the two jets based on a 
smooth function of distance from the seeds, though in 
practice this detail is unimportant; pairs of overlapping 
jets are reconstructed into scalars, and only information 
about these reconstructed scalars is used as a kinematic 
discriminant. As a rough approximation, we take the 
triggering and tagging efficiencies as equal for signal and 
background and independent of kinematic configuration. 
We define £ = ^o/i-J^tag^trig) where £ is the signifi- 
cance, E ta g is the efficiency to tag all four 6 jets, (.trig 
is the trigger efficiency and So is the significance (de- 
scribed below) assuming e t rig — ttag = 1- Our results 
are described as an estimate the efficiencies required to 
generate a significance £ of 3. Oct. 

Beyond 6-tagging, we require all 6 jets to have px > 10 
GeV, and of course for all of them to live within the 



1 We are counting partons as opposed to jets as an important signal 



region occurs when pairs of b jets are partially merged. 
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FIG. 1: Reconstructed scalar mass versus reconstructed Higgs 
mass for 2 fb _1 worth of signal at different points in parameter 
space along with contour lines for background. Each contour 
(moving from left to right) represents a halving of the back- 
ground intensity, with the outermost contour at a background 
density (in m a vs. mj space) of ~ 128 events/GeV 2 given 2 
fb _1 of data. Note that the characteristic slope of the sig- 
nal distributions is relatively constant across a large region of 
parameter space. 



detector acceptance. LHCb's nominal acceptance range 
in pseudorapidity 1.9 < r\ < 4.3 on the vertical axis and 
2-1 < t] < 4.3 on the horizontal axis. 

Once these four b jets have been reconstructed, there 
are three possible ways to combine them into two pairs 
of jets in order to reconstruct the two a particles. Re- 
quiring the two pairs to have the same invariant mass 
within detector resolution turns out not to be a useful 
discriminant. The combination which is chosen is that 
which minimizes the quantity (Ri) 2 + (R2) 2 where Ri 
is the previously-defined distance in r\-<\> space between 
the first pair of jets and R2 is the distance between the 
second pair. It should be noted that this technique lim- 
its us to the study of relatively light scalars whose decay 
products tend to be collimated. Our method chooses the 
correct combination of jets > 90% of the time for all the 
previously mentioned sets of parameters other than with 
n%h = 130 GeV, m a = 35 GeV and mh = 115 GeV, 
m a = 35 GeV where the proper combination was recre- 
ated 88% and 76% of the time, respectively. For scalar 
masses closer to the upper kinematic limit (m n ~ mh/2) 
it is possible to recreate the proper jet combination mod- 
erately well (up to ~ 70% when m a > 0.48m?,,) by choos- 
ing the combination which maximizes (-R1) 2 + (R2) 2 ■ Un- 
fortunately, this method does not significantly reduce the 
background once an overall invariant mass cut is placed 
on the four jets. 

From each event, we have therefore reconstructed a 
Higgs mass and two a masses. The mean of these two 
a masses is taken as the a mass in all further analyses. 
When vrih is plotted against m a as in Figure Q] signal 
events generated by different sets of parameters tend to 



group into elongated ellipses. Events generated with a 
lower m a tend to be more tightly grouped than those with 
a higher mass, but the slope of the long axis of the ellipse 
seems to be independent of the location in parameter 
space and is ~ ^. See the appendix for details. This 
spread in invariant masses is a result of the jet (cone) 
algorithm we have chosen but does not take into account 
general smearing due to detector effects. It would be 
interesting to see if a nearly constant slope is maintained 
in a full simulation. 

The fact that this slope is fairly constant across pa- 
rameter space allows us to perform the following trans- 
formation: 



nih + 0.333m a 



(3) 



The resulting quantities q~ and q + are therefore roughly 
uncorrelated for a set of signal events from any point in a 
large region of parameter space. The standard deviation 
of q~ for such a set is of the order of 5 — 10 GeV, while 
that of q + is of the order of 20 GeV. 

To proceed, we center a box of size 15 GeV in q~ and 50 
GeV in q + at various points (a«, bj) on a grid in (q~ , q + ) 
space and count the number of signal and background 
events falling in each of these boxes. These numbers are 
used to estimate the significance for the central point of 
each box where the significance So = S/*/S + B, where 
S and B are the number of signal and background points 
in the box respectively. Adjacent central points are sep- 
arated by <ij-|_i — dj = 5 GeV and bj+% — bj = 10 GeV. 



TABLE I: Efficiency required, e req to see a 3. Oct significance 
with 2 fb _1 of data for various values of ran and m a along with 
the point (in q + and q~) at which the greatest significance is 
seen. The ratio of signal to background, S/B, included in 
this significance is also provided. The first four columns are 
in units of GeV. The required efficiency scales approximately 
with the inverse square root of the total luminosity. 
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peak. This is a result of the background dropping much 
faster toward lower q + in this region than the signal, also 
giving a significantly higher signal-to-background ratio. 
It is evident from these results that the difficulty of de- 
tecting this decay of the Higgs is fairly independent of the 
mass of the Higgs boson, while it is strongly dependent 
on the scalar mass. 

We present the results for one point in parameter space 
(m/j = 130 GeV, m a = 15 GeV) graphically in Figure [2] 
by restricting ourselves to events with 100 GeV < q + < 
150 GeV and histogramming these events by q~ into bins 
of size 5 GeV. The range of q + is that which gives the 
maximum significance for these parameters. 



FIG. 2: Number of events versus q~ for 2 ftV 1 worth of signal 
and background restricted to events with q + in the range 100 
GeV < q + < 150 GeV assuming mu = 130 GeV and m a = 
15 GeV. The range of q + represents the region of maximum 
significance for this signal. 



The boxes around neighbouring central points overlap, 
therefore their significances are correlated and cannot be 
independently combined. No trials factor is included. 
Table U provides a list of the points with the highest sig- 
nificances for various pairs of parameters (m a ,mh)- The 
significance of each of these points is presented as the ef- 
ficiency (ttrigg * (-tag) required to see a significance of 3.0c 
with 2 fb _1 (as an approximation, taking both signal and 
background to have identical efficiencies and that these 
efficiencies are independent of kinematic configurations) . 
It should be noted that in all cases the 15 GeV by 50 
GeV box with the highest significance captured a large 
fraction of signal events (ranging from 0.4 to 0.7, with a 
negative correlation between m a and the capture rate). 
Interestingly, for cases where m a was 15 GeV the highest 
significance was seen at q + slightly lower than the signal 



III. CONCLUSION: ESTIMATED REACH 

From the data it is apparent that given a relatively 
light pseudo-scalar and robust 6-tagging and triggering 
efficiencies the reach of this search technique extends 
across most or all Higgs masses where decays into pseudo- 
scalars could dominate (from the lower bound on the 
Higgs mass of ~ 115 GeV to somewhat less than twice 
the Z-mass). It is also apparent that for pseudo-scalar 
masses greater than 30 GeV detection of the Higgs using 
our technique is impossible. Logarithmically, the region 
of detectability 2mt, < m a < 30 GeV represents slightly 
more than half the range of masses where decays of Higgs 
bosons to b quarks could be expected to dominate (2to{, 
< m a < ^o*-. Achieving this reach is heavily dependent 
on efficient and accurate 6-tagging, especially in configu- 
rations where two b jets overlap. 
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Am Ql 
Am tt2 



m a 



(5) 



Boosting to the rest frames of the respective pseudo- 
scalars, 



Am ai ~ = 8 (p) 



Am n 



m a 

q'^'M 



Soil) 



(G) 



Remembering our definition of the measured scalar 

/ \ m a . (m.eas)-\-m a ~ (rneas) 

mass m a (meas) = — — — — — -, 



Am = Am ai + Am a2 ^ S' a {p) + S' a \q) ^ 

We now examine the error in the measurement of the 
Higgs mass: 

Am h =[(p^ + ^ + <y«(p) +*"(?)) 



-Kp" + ?")(?,- + <&.)]* 



(8) 



We again drop terms containing S 2 and approximate 



the square root to first order in 

membering that (p M + q^)(j>^ + q^) = (jrih) 2 and that we 
are in the rest frame of p M + q^: 



IV. APPENDIX: CORRELATION OF ERRORS 
IN THE MEASUREMENT OF SCALAR AND 
HIGGS MASSES 

Let the first scalar have momentum p^ and the second 
scalar have momentum in the rest frame of the higgs 
boson, while the error in measurement of each of these 
4- vectors is 5^{p) and 6 ti (q) respectively. The errors in 
measurement of the scalar masses are: 

Am ai = [(jf + 6»(p))(p, = 5^p))]i - [p%]5 
Am Q2 = [(<f+6»{q)){q ll + 5 M (q))]t ~ (4) 

Assuming that p^Pp, ~> p >1 S fl (p) 3> 5 ,l {p)5 fjL {p) we drop 
terms containing factors of S tJ/ (p)S fJi (p) and approximate 

the square root to first order in p p ^ p ) ■ We carry out the 
same operation on the second scalar as well, remembering 
that p^pp = q^q^ = (m a ) 2 : 



m h 

We now make the assumption that the major sources of 
error in the measurement of the masses derive from ultra- 
relativistic particles originating from the Higgs which es- 
cape the two cones around the component b hadrons of 
each scalar and other ultra-relativistic particles originat- 
ing in the underlying event which happen to enter these 
cones. 
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Given particle i from the underlying event entering the 
jet cone of a%, while T^(p) makes an angle of 4>i with p 
and a\ has a boost of j ai and velocity (3 ai relative to the 
Higgs, then 

n (p) = l ai T l0 {p){l + (3 ai cos fa) 



(11) 



Similarly, given particle j (originating from a\) escap- 
ing the two relevant jet cones, while Pj(p) makes an angle 

of dj with p and a\ has a boost of j ai and velocity f3 ai 
relative to the Higgs, then 



Pjoip) = 7oiPj (p)(l + Pax COS0j) 
m,. 



2m 
= B{B\) 



1 



1 



2m a 
m h 



cos0,- p,» 



Pi ip) 



(12) 



In the frame where a\ has no momentum in the z di- 
rection and the Higgs has no transverse momentum (the 
"transverse frame"), given an ultra-relativistic particle 
making an angle of a T with p T (the transverse compo- 
nent of p) it makes, in the rest frame of a\ an angle of a 
with p T and 



cos a 



cos a 



1 + Kx cos ' 



(13) 



where 0£ is the boost of ai in the transverse frame. If 
the Higgs has no transverse momentum and p makes an 
angle * with the beam axis, then 



/3 ai sin* 



1 - PI 



(14) 



We model the union of the two b hadron cones as a 
single cone of radius R a (in rj-<fi space) centered on the 
scalar. In the transverse frame, this cone is approxi- 
mately a cone in cartesian space with a^ax = R a - 

Now, the particles from the underlying event are ap- 
proximately uniformly distributed over 77, so in the trans- 
verse frame they are isotropically distributed over small 
a T (since Ai] is relatively insensitive to boost along the 
beam axis and r\ ~ (tt/2 — 9) for 7r/2 — 9 small). The 



particles from the underlying event which contribute to 
measurement error fall into the jet cone. We therefore 
use eqns. [Til H31 an d HH to write A(4> ) in terms of <j) T 
(the angle between the particle from the underlying event 

TABLE II: A and B using R a = 0.85 for various points in 
parameter space. 



m h 
m„. 


A 


B 


4.0 


4.6 


1.8 


5.0 


4.9 


1.7 


6.0 


5.1 


1.5 


7.0 


5.2 


1.4 


8.0 


5.2 


1.3 


9.0 


5.2 


1.3 



and the scalar in the transverse frame) and * (the angle 
between the scalar and the beam axis in the rest frame of 
the Higgs), averaging over < (j> T < R a and < $ < tt, 
using 



A 



Jo d * I Ha d^Mf , *) sin * sin < 
fo<Mf* a d(t> T sin* sin (j> T 



(15) 



Meanwhile, daughter particles of a\ are (over an en- 
semble of decays) distributed isotropically in the refer- 
ence frame of a%. In this frame, the jet covers an area 
from a = to a' max = a max {aj naxl ^) = a' max (R a ,^) 
using eqns. [T3]and[T31 The daughter particles which con- 
tribute to measurement error are those which fall outside 
this cone. We average over a max (R a , *) < 9 < tt and 
< * < tt: 



C d* P, 

■jt JO Ja. 

_D = 



d6'B( 9' )smVsm9' 



lo^Ia' de ' Sil1 * Sil1 °' 



(16) 



Finally, we note that the above calculations hold true 
for mis-measurements of the second pseudo-scalar's 4- 
momentum and their effects on the measurement of m a 
and mh- 

If the error in the measurement in m a was entirely 
due to particles from the underlying event then we could 
approximate Amj, ~ Am a A. If it was entirely due to 
daughter particles of the scalars escaping the jet cones 
then Anih — Am a B. Table [TT1 gives values for A and B 
for different mass ratios ^ assuming R a — 0.85 (slightly 

larger than the individual cone size). We can expect 

to lie somewhere between these two extremes. Note that 

A > 3.0 while B < 3.0. 



